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ABSTRACT 

We present a new algorithm (PINOCCHIO, PINpointing Orbit- Crossing Collapsed Hierarchical ob- 
jects) to predict accurately the formation and evolution of individual dark matter haloes in a given 
realization of an initial linear density field. Compared with the halo population formed in a large (360'^ 
particles) coUisionless simulation of a CDM universe, our method is able to predict to better than 10 per 
cent statistical quantities such as the mass function, two-point correlation function and progenitor mass 
function of the haloes. Masses of individual haloes are estimated accurately as well, with errors typically 
of order 30 per cent in the mass range well resolved by the numerical simulation. These results show that 
the hierarchical formation of dark matter haloes can be accurately predicted using local approximations 
to the dynamics when the correlations in the initial density field are properly taken into account. The 
approach allows one to automatically generate a large ensemble of accurate merging histories of haloes 
with complete knowledge of their spatial distribution. The construction of the full merger tree for a 256^ 
realisation requires a few hours of CPU-time on a personal computer, orders of magnitude faster than the 
corresponding Y-body simulation would take, and without needing any extensive post-processing. The 
technique can be efficiently used, for instance, for generating the input for galaxy formation modeling. 

Subject headings: galaxies: haloes - galaxies: formation - galaxies: clustering - cosmology: theory - 
dark matter 



1. INTRODUCTION 

In currently favoured dark matter dominated cosmolog- 
ical models, initially small density fluctuations arc ampli- 
fied by gravity and eventually condense out of the Hubble 
expansion to form gravitationally bound systems at a den- 
sity contrast of ^ 200 - dark matter haloes (e.g. Peebles 
1993). The properties of the halo population are of fun- 
damental importance for understanding galaxy formation 
and evolution. Indeed, galaxies are thought to form when 
baryons fall into such dark matter haloes and arc shocked 
to sufhciently high temperatures and densities that the gas 
can cool radiatively to form stars (Rees & Ostriker 1977, 
White & Rees 1978). 

The formation of haloes can be studied using numeri- 
cal simulations which usually evolve a set of equal mass 
particles that represent the dark matter in a periodic sim- 
ulation box (e.g. Efstathiou et al. 1985). A popular way 
of identifying 'haloes' in such calculations is the friends- 
of-friends (FOF) algorithm, which links particles within a 
fraction b of the mean inter-particle spacing into one halo, 

at a density contrast of ~ 1/b^. Other halo identification 
algorithms generally give similar results. Jenkins et al. 
(2001) combined the results from simulations with a vari- 
ety of box sizes to obtain the mass function n{M) of FOF 
haloes over a large dynamic range. 

Analytical descriptions of the halo formation process 
were pioneered by Press & Schechter (1974, hereafter PS) 
and were recently reviewed by Monaco (1998). Although 



the PS mass function and its extensions (the so-called ex- 
cursion set approach. Bond et al. 1991) fit the numerical 
FOF mass function reasonably well (e.g. Efstathiou et al. 
1988), there are real discrepancies both at large and small 
masses where PS respectively under and over predicts halo 
numbers (e.g. Governato et al. 1999; Jenkins et al. 2001; 
Bode et al. 2001). Similar discrepancies are found when 
reproducing the mass function of the progenitors of halos 
of given mass (Sheth & Lemson 1999; Somerville et al. 
2000). In addition. Bond et al. (1991) and White (1996) 
demonstrated that the PS approach achieves a very poor 
agreement on an object-by-object basis when compared 
to simulations (but see Sheth, Mo & Tormen 1999 for a 
different view). Analytic approaches based on the peaks 
of the initial density fields did not achieve a better agree- 
ment with simulations (Katz, Quinn & Gelb 1993). In- 
termediate between simulations and analytical techniques 
are perturbative approaches that describe the growth of 
haloes in a given numerical realisation of a linear density 
field, such as the truncated Zel'dovich (1970) approxima- 
tion (Borgani, Coles & Moscardini 1994), the peak-patch 
algorithm (Bond & Myers 1996a,b) and the merging cell 
model (Rodriguez & Thomas 1996; Lanzoni, Mamon & 
Guiderdoni 2000). 

In this paper we present a new algorithm to compute 
the formation and evolution of dark matter haloes in a 
given linear density field. A ID version of this algorithm 
was given by Monaco & Murante (2000). In common with 
the other perturbative approaches, we combine a local de- 
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scription of the dynamics in order to identify collapsed 
haloes with Lagrangian perturbation theory to displace 
the haloes to their final positions. We demonstrate that 
the algorithm leads to an accurate description of the de- 
tailed clustering and merger history of haloes while re- 
quiring several orders of magnitude less computer time 
and post-simulation analysis than the corresponding full 
blown numerical simulation. In addition, the successful 
reproduction of the merger history demonstrates that we 
have identified the key processes that govern halo forma- 
tion, and that these can be described with a perturbative 
approach. 

We follow a two step procedure that mimics the hierar- 
chical built-up of haloes through accretion and merging. 
The first step identifies orbit-crossing (hereafter OC) as 
the instant at which a mass element undergoes collapse. 
We compute OC numerically by applying local ellipsoidal 
collapse approximation to the full Lagrangian perturba- 
tive expansion (Bond & Myers 1996a, Monaco 1995, 1997). 
This part is the more computationally expensive, requiring 
several hours of computer time on a personal computer for 
a 256^ realization. The second step groups the collapsed 
particles into disjoint haloes, using an algorithm similar to 
that used to identify haloes in iV-body simulations. Ba- 
sically, a particle accretes onto a halo if it is suflaciently 
close to it at its collapse instant. We use Lagrangian 
perturbation theory (LPT, Catelan 1995, Bouchet 1996; 
Buchert 1996) to compute the positions of haloes and par- 
ticles. Seed haloes are local maxima of the collapse red- 
shift. This second step automatically determines the full 
merger history of haloes and requires negligible computer 
time. Compared to simulations, the first step determines 
when a simulation particle enters a high-density region 
whereas the second identifies the haloes. 

Since our method describes, in the linear density field, 
the hierarchical built-up of objects that have undergone 
OC, we refer to it as PINOCCHIO: PINpointing Orbit- 
Crossed Collapsed HIerachical Objects. In the next sec- 
tion we describe the algorithm in more detail. In section 

3 we compare its predictions with those from simulations 
and discuss possible applications of the method. Section 

4 gives the conclusions. Technical details and resolution 
issues are addressed in forthcoming papers (Monaco, The- 
uns & Taffoni 2001; Taff'oni, Monaco & Theuns 2001). 

2. THE ALGORITHM 
2.1. Orbit crossing 

Consider a random realization of a density field, p(q), 
where q denote Lagrangian (initial) coordinates, and let 
(f){q) be the corresponding peculiar potential. Both fields 
can be smoothed by convolving them with a Gaussian with 
FWHM R; we denote them as p{q, R) and (f){q, R) respec- 
tively. The first derivative of the potential, dq. 4>, describes 
the motion of the particle in the Zel'dovich (1970) ap- 
proximation, and the shear tensor, dq^dq.cf), can be used 
to give a description of the deformation of the mass ele- 
ment based on ellipsoidal collapse (Bond & Myers 1996a, 
Monaco 1995, 1997). In our context ellipsoidal collapse is 
a convenient truncation of LPT (Monaco 1997). 

For a given smoothing radius R, the density of a mass 
element will become infinite as soon as at least one of the 

® On the initial grid q of Lagrangian positions, the six particles nea 



ellipsoid's axes reaches zero size, at which point the rela- 
tion x(q) becomes multi-valued and the Jacobian of the 
transformation q — > x, J = det|9x/(?q| = 0. This is the 
definition of OC. We argue that after this instant tc, non- 
linear processes will become important and hence further 
predictions of what happens to the mass element can not 
be safely made using LPT. However, as the density of the 
mass element is already very high, we regard it as a candi- 
date for the building up of a collapsed halo at time tc{R)- 
A different definition of collapse was used, e.g., by Au- 
dit, Teyssier & Alimi (1997), Lee & Shandarin (1998) and 
Sheth & Tormen (2000). 

In practice we generate the density field p on a cubic 
grid. In our description, mass elements (or 'particles') 
then correspond to the grid vertices q. The potential (f) 
and its derivatives are computed from p using fast Fourier 
transforms. We typically use ~20 logarithmically spaced 
smoothing radii. Applying local ellipsoidal collapse to each 
particle, we obtain the collapse redshift on each smooth- 
ing scale, and we record for each particle the highest col- 
lapse redshift Zc, the corresponding smoothing scale Rc 
and the Zel'dovich (1970) estimates for the peculiar ve- 
locity Vc{Rc) OC V0(q, i?c) on that smoothing scale. Note 
that at this stage we make no prediction of the mass of 
the collapsed halo that the particle accreted onto. 

In fact, the collapsed mass element will not necessarly 
have accreted onto any halo, but may instead have be- 
come part of a filament or sheet (collectively referred to as 
'filaments' hereafter), since these have undergone OC as 
well. These structures trace the moderate over densities 
that connect the much higher density collapsed haloes in 
simulations. The next subsection describes how the OC 
region is divided into collapsed haloes and OC filaments. 

2.2. Fragmentation 

The grouping of OC particles into haloes mimics the hi- 
erarchical formation of objects, and also the way in which 
halo finders identify collapsed objects in simulations. We 
begin by sorting particles according to decreasing collapse 
redshift Zc, and, starting from the highest Zc we decide 
the fate of the collapsed particle, working our way down 
forward in time to the last particle to collapse. 

Briefly, at the instant the particle is deemed to collapse, 
we decide which halo, if any, it accreted onto. The candi- 
date haloes are those that already contain one Lagrangian 
neighbour of the particle^. The particle will accrete onto 
the halo if it is 'sufficiently close' to it at the collapse time, 
mimicking the construction of FOF haloes. We use the 
Zel'dovich velocities Vc as defined earlier to compute the 
distance, at the collapse time, between the particle and 
the candidate halo. If a particle has more than one can- 
didate halo, we also check whether these haloes should 
merge, using a similar merger criterion. Notice that in 
this way haloes are by construction connected regions in 
Lagrangian space. 

More in detail, we apply the following rules for accre- 
tion and merging. (Lengths arc; in units of the grid spacing; 
Rm = M^/^ is the 'radius' of a halo of M particles.) 

(1) Seed haloes Local maxima of the collapse redshift 
Zc are seeds for a new halo. 

to a given particle are its 'Lagrangian neighbours'. 
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Fig. 1. — Comparison of mass function Mn(M) in a standard CDM model (Cm = 1). Top panel: simulated mass function for FOF selected 
haloes (Full lines with Poissonian error bars), PINOCCHIO mass function (filled circles), the fit by Sheth and Tormen (short-dashed lines) 
and PS function (long-dashed lines), at redshifts z = 0, 0.43, 1.13 and 1.86 (higher redshift curves are off-set by 0.1 dcx both vertically and 
horizontally for improved clarity). Vertical lines show limits corresponding to simulation haloes with 10, 50, 100, 500 and 1000 particles (256^ 
re-sampling). Bottom panel: Difference between simulated mass function and PINOCCHIO (filled dots), Sheth and Tormen fit (short-dashed 
line) and PS (long-dashed line) at 2 = 0. 




Fig. 2. — Predicted halo mass against FOF halo mass for a subset of the particles of Govornato et al. (1999) Cm = 1 simulation at redshift 
2 = The PINOCCHIO masses are highly correlated with the FOF masses. Points which have not collapsed have been arbitrarily assigned 
a mass of 1O^^M0. 
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(2) Accretion A collapsing particle (not a local 
maximum) accretes onto a candidate halo (i.e. 
containing one of its Lagrangian neighbours) if the 
distance d, at the collapse time, between particle 
and halo ccntrc-of-mass is d < faRu- .fa is a 
parameter of order unity, analogous to the linking 
length parameter used to identiiy FOF haloes. If 
the particle is able to accrete onto two (or more) 
haloes, we assign it to the one for which d/Ru is 
the smallest. 

(3) Merging If the particle has more than one 
candidate halo, then these haloes are merged if 
their mutual distance d, again at the particle's 
collapse time, is < fmRu, whore Rm refers to 
the larger halo and /m is again a parameter of 
order unity. Since we only consider six Lagrangian 
neighbours, up to six haloes may merge at a given 
time, although binary and ternary mergers are of 
course much more frequent. 

(4) Filaments With these rules for accretion and 
merging, some collapsing particles do not accrete 
onto a halo at their collapse time. Since these 
particles tend to occur in the mildly ovcrdcnsc 
regions that connect the haloes (visible as a fila- 
mentary network between haloes in simulations), 
wc assign them to a 'filaments' group. In TV-body 
simulations, some particles accrete onto a halo 
directly from this filamentary network, without 
passing through a collapsed halo first. In order 
to account for this, we check the Lagrangian 
neighbours of a particle that accretes onto a halo 
according to the accretion condition (2). If any of 
these neighbours already belong to the filaments 
group, then they also accrete onto that halo. (So 
up to five additional particles may accrete onto 
the halo, if their common Lagrangian neighbour 
satisfies condition (2)). 

When the groups are very small, R.m is comparable to 
the grid spacing, and the Zel'dovich displacements are of- 
ten not sufficiently accurate for the accretion or merging 
condition to be fulfilled. This resolution effect results in 
producing too few small haloes at high redshift. To remedy 
this wc improve the accretion condition to rf < faRu + ./r, 
and similarly for merging. Our algorithm thus contains 
three parameters (/a, fm and /r) which need to be cali- 
brated using the FOF mass function as determined from 
a simulation, and which have obvious physical interpre- 
tations in terms of accretion, merging and resolution ef- 
fects. Optimal values are /a = 0.18, /m = 0.35 and 
/r = 0.7. These values were obtained by comparing the 
PINO C OHIO mass function with those of several simu- 
lations, including the standard SCDM one discussed be- 
low, a ACDM simulation (n„ = 0.3, CIa = 0.7, dg = 1, 
h = 0.7) run with the same simulation code and box size 
(500 Mpc) and another ACDM simulation (Qa = 0.7, 
rim = 0.3, (78 = 0.9, h = 0.65) with different resolutions 
(128^ and 256^ particles) in a smaller box of 100 Mpc, 
evolved with the P3M HYDRA code (Couchman 1991). 
The agreement between PINOCCHIO and these other 
simulations is as good as the comparison with the SCDM 



simulation described in the next section; the best fit pa- 
rameters are found to agree within ~0.01. However, at 
smaller and more non-linear scales more subtle resolution 
effects appear, which can be corrected for. These details 
will be discussed in the forthcoming paper Monaco et al. 
(2001). 

3. RESULTS 

Wo have applied PINOCCHIO to the initial conditions 
of a simulation by Govornato et al (1999). This large vol- 
ume dissipationless simulation uses 360"^ dark matter par- 
ticles and was evolved using the PKDGRAV Tree code 
(comoving box size 500 Mpc, matter density ilm = 1, 
Hubble constant Hq = 50 km s^^ Mpc"\ standard CDM 
spectrum with erg = 1). Haloes have been identified at 
several output times using a standard FOF algorithm with 
linking-length b = 0.2. PINOCCHIO is fast: resampling 
the initial conditions onto a 256'^ grid, the first stage of 
computing orbit-crossing requires ^ 6 hours of CPU time, 
the second step of identifying the haloes takes just a few 
minutes. (Timings refer to a Pentium III 450MHz per- 
sonal computer. Memory requirement in this case amounts 
to ~ 512Mbytes of RAM.) These timings should be con- 
trasted with the several hundreds of hours on 256 nodes of 
a T3E Cray supercomputer required to perform the orig- 
inal simulation. Moreover, PINOCCHIO immediately 
outputs the merger tree of each halo, which should be con- 
trasted to the complicated and expensive postprocessing 
necessary to extract merger trees from a simulation. 

One way to understand the large speed-up between an 
TV-body simulation and PINOCCHIO is that most of 
the CPU time used in the A^-body simulation is spent 
integrating the orbits for particles already inside a halo. 
These particles undergo large accelerations as they orbit 
inside the halo, and hence may require thousands of time- 
steps in order for their orbits to be integrated accurately. 
PINOCCHIO on the other hand, completely ignores par- 
ticles once they are inside a halo and so can use far fewer 
steps per particle to perform the whole simulation, since 
it only needs to compute the particle's orbit before it en- 
ters any high density region. Obviously, all information 
on the internal structure of the halo is lost in the pro- 
cess, but it is well known that several millions of particles 
are required to get the internal structure correct. (See 
the controversy about the slope of halo profiles as deter- 
mined using high-resolution coUisionless simulations, e.g. 
Ghingha et al. 2000). In the following we demonstrate 
that PINOCCHIO is indeed able to predict the merging 
and clustering properties of haloes very accurately. 

PINOCCHIO reproduces the mass function Mn{M) 
(number of objects per unit volume and unit In M) to bet- 
ter than 10 per cent at all redshifts (Figure 2), in the mass 
range in which haloes have at least ~30 particles and Pois- 
son error bars are small. To make this more evident, we 
plot in the lower panel of Figure 2 the residuals with re- 
spect to the z = FOF mass function. This level of accu- 
racy improves over the fit proposed by Sheth and Tormen 
(2000). (The PS mass function, which over (under) pre- 
dicts the number of low (high) mass objects, is shown for 
comparison as well.) 

The good agreement between halo masses is not just sta- 
tistical in nature. We have plotted in figure 2 the masses of 
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Fig. 3. — Statistics of halo overlap between PINOCCHIO and FOF objects for haloes at z = (continuous lines), z = 0.43 (dotted lines), 
z = 1.13 (short dashed lines) and z = 1.86 (long dashed lines). Upper panel: fraction /d of 'cleanly assigned' halo pairs between the two 
catalogs, as a function of mass. Middle panel: fraction /sput of FOF halos that are split in two PINOCCHIO halos. Lower panel: average 
overlap in Lagrangian space, /ov, for cleanly paired-up halos. (See text for definitions of f^i and /sput- ) As in Figure 1, vertical lines show 
limits corresponding to simulation haloes with 10, 50, 100, 500 and 1000 particles (256^ re-sampling). 
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Fig. 4. — PINOCCHIO versus FOF halo masses for objects that were cleanly assigned between the two respective catalogs. Note that each 
dot corresponds to a halo pair, which contrasts with Figure 2, where each dot refers to a random point in the initial conditions. 
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Fig. 5. — Conditional mass function of haloes with mass Mq = 5 x 10 at redshift z = 0, at the earlier redshifts 1.13 and 1.86 as 

indicated. Full lines with Poissonian error bars are for the simulation, filled circles correspond to the PINOCCHIO prediction, long-dashed 
lines are the conditional mass function from the PS theory (Bower 1991). The higher redshift results have been off-set vertically by 1 dox for 
clarity. The PINOCCHIO mass function follows the simulations significantly better than the PS one. 
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Fig. 6. — Correlation functions for haloes within a given mass range as a function of co-moving separation R, for the two redshifts indicated 
in the panels. Symbols refer to FOF selected haloes, lines to PINOCCHIO haloes. Mass ranges are 10^'* < log M/Mq < lO^*'^ (filled squares 
and full lines), and log M/Mq > lO^^'^ (open squares and dashed lines respectively). Lower mass curves have been off-set vertically by 1 dex 
for clarity. The number of contributing haloes for the lower mass range is ft: 41 xlO^ and 12 xlO^ (for increasing redshifts), for the higher 
mass range 19x10^ and 0.5x10^ 
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the haloes that a particle is assigned to, for both PINOC- 
CHIO and FOF haloes, for a random subset of particles 
drawn from the initial conditions. The correlation between 
PINOCCHIO and FOF masses is extremely tight, and is 
dramatically bettor than PS (compare with Figure 2 in 
Sheth, Mo & Tormen 2001), and also improves over peak- 
patch (Bond & Myers 1996b, their figure 11). 

Figure 2 contains outliers which correspond to particles 
that are assigned to a different halo (or are not assigned 
to a halo at all) by PINOCCHIO than by the sinndation 
(or vice versa). We have investigated in detail the typical 
overlap in the initial conditions between simulated haloes 
and those found by PINOCCHIO. Since PINOCCHIO 
refers to the same initial conditions as the simulation, we 
can detc;rmino the fraction of Lagrangian volume Vp of a 
given halo identified by PINOCCHIO that overlaps the 
Lagrangian volume Vfof of a FOF halo. In general, for 
any FOF halo, the volume VpoF niay overlap with the La- 
grangian volumes of several PINOCCHIO haloes (and 
vice versa). For example, if two PINOCCHIO haloes 
fail to merge, whereas the corresponding FOF haloes do 
merge, then the volume Vfof may be broken- up into two 
PINOCCHIO volumes Vp. We choose to pair-up two 
haloes between the two catalogs, if their Lagrangian vol- 
umes overlap to better than 30 per cent. Paired- up haloes 
are in addition called 'cleanly assigned' if the intersection 
of l^oF with Vp is larger than for any other FOF halo, and 
vice versa. Paircd-up haloes that are not cleanly assigned 
are called 'split'. Denoting the fraction of haloes that are 
cleanly assigned and the fraction that are split by /d and 
/split respectively, then the fraction of haloes that are not 
paired-up is obviously 1 - /ci - /split- 

In figure 3 we show fd and /sput as a function of halo 
mass, for several redshifts. For sufficiently massive haloes 
M > 10"Mq (corresponding to 40 particles), fd > 0.8, 
showing that most FOF haloes can be unambigously as- 
sociated with a corresponding PINOCCHIO halo, while 
the fraction of FOF haloes split in two or more PINOC- 
CHIO haloes is small. The fraction 1 — /d — /sput of FOF 
haloes that have no corresponding PINOCCHIO halo is 
very small as well, ranging from < 1 per cent for the most 
massive haloes, to ~ 15 per cent for small haloes with ~40 
particles. The latter limit is close to the minimum number 
needed to correctly numerically simulate the formation of 
a halo, given an initial density field. For cleanly assigned 
haloes, the bottom panel in the figure shows the fractional 
overlap /ov of the respective Lagrangian volumes. A typi- 
cal values for well resolved haloes is /ov ~ 0.7, indicating 
that the mass errors are usually smaller than 30 per cent. 
This is made more clear in figure 3, which compares the 
FOF with the PINOCCHIO masses for cleanly assigned 
halo pairs. The correlation is very tight. The level of 
agreement between PINOCCHIO and simulations is only 
weakly dependent on redshift. 

Since PINOCCHIO haloes are in detail very similar to 
their corresponding FOF haloes, their merging history and 
clustering properties can be expected to be very similar as 
well. The conditional mass function n{M, z\Mo, zq) (the 
number density of objects of mass M at redshift z that 
are merged in haloes of mass Mq at the later redshift zq) 
is shown in Figure 3. The PS prediction, computed fol- 
lowing Bond et al. (1991; see also Bower 1991; Lacey & 



Cole 1993) is also shown. Also in this case the agreement 
between PINOCCHIO and the simulation is very good, 
making an improvement with respect to PS and demon- 
strating that PINOCCHIO haloes undergo a very similar 
merging history as do FOF haloes. 

Finally, we compare in Figure 3 the two-point corre- 
lation function ^(r) of haloes as a function of mass and 
redshift. The agreement with the simulations is again 
very good. In particular, the high clustering amplitude 
of massive; haloes at early times is well reproduced, and 
the correlation length ro is recovered to within 10 per cent 
or better, thus improving the PS-like estimate of Sheth 
et al. (2001) and allowing to discriminate easily between 
different cosmological models (Colberg ct al. 2001). The 
quality of this agreement suggests that halo positions are 
well estimated by PINOCCHIO; we find that the ID rms 
error on the final positions is ~ 0.8 Mpc (smaller than 
the grid spacing) , while velocities are recovered with a ID 
rms of ~150 km/s. 

4. CONCLUSIONS 

We have demonstrated that PINOCCHIO is able to 
accurately describe the evolution of clustering of haloes as 
a function of mass. Therefore, when combined with semi- 
analytical models for galaxy formation (White & Frenk 
1991, Kauffmann, White & Guiderdoni 1993, Cole et al 
1994, Somerville & Primack 1999), PINOCCHIO can be 
used to reliably generate mock galaxy catalogs, with the 
correct evolution of galaxy clustering build-in, while re- 
quiring orders of magnitude less computer time than nu- 
merical simulations. Easy and accurate production of large 
halo catalogues is invaluable for interpreting data and es- 
timating errors from galaxy or galaxy cluster surveys, for 
example when studying galaxy bias (Diaferio et al. 1999; 
Benson et al. 2000), estimating power-spectra (e.g. Efs- 
tathiou & Moody 2001), determining shear from weak lens- 
ing measurements (van Waerbeke et al. 2000, Wittman et 
al. 2000, Bacon, Refregier & Ellis 2000, Kaiser, Wilson 
& Luppino 2000) or studying intrinsic galaxy alignments 
(Crittenden et al. 2001, Brown et al. 2000). 

A more detailed and technical account account of the 
code, suitable for those who wish to use it, will be given 
in a forthcoming paper (Monaco et al. 2001), while the 
ability of predicting halo merger histories beyond the pro- 
genitor mass function will be presented by Taffoni et al. 
(2001). A public version of PINOCCHIO is available at 
the site http://www.daut.univ.trieste.it/pinocchio. 
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